Bit indexed explicit replication forwarding optimization

ABSTRACT

Various systems and methods for performing bit indexed explicit replication (BIER). For example, one method involves receiving a packet at a node. The packet includes a bit string. The node traverses the bit string and selects an entry in a bit indexed forwarding table (BIFT). The entry includes a forwarding bit mask. Based on the forwarding bit mask and the bit string, the node forwards the packet.

RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No.14/603,580, entitled “Bit Mask Forwarding Replication Optimization,”filed Jan. 23, 2015, which claims the domestic benefit under Title 35 ofthe United States Code § 119(e) of U.S. Provisional Patent ApplicationSer. No. 61/931,473, entitled “Bit Mask Forwarding Architectures forStateless Multipoint Replication,” filed Jan. 24, 2014, which is herebyincorporated by reference in its entirety and for all purposes as ifcompletely and fully set forth herein.

U.S. application Ser. No. 14/603,580, entitled “Bit Mask ForwardingReplication Optimization,” filed Jan. 23, 2015, is also acontinuation-in-part of U.S. application Ser. No. 14/488,790, entitled“Bit Indexed Explicit Replication Using Multiprotocol Label Switching,”filed Sep. 17, 2014, which in turn claims the domestic benefit underTitle 35 of the United States Code § 119(e) of U.S. Provisional PatentApplication No. 61/878,693, entitled “Multicast IPv6 with Bit MaskForwarding,” filed Sep. 17, 2013, and U.S. Provisional PatentApplication No. 61/931,473, entitled “Bit Mask Forwarding Architecturesfor Stateless Multipoint Replication,” filed Jan. 24, 2014. U.S.application Ser. No. 14/603,580, entitled “Bit Mask ForwardingReplication Optimization,” filed Jan. 23, 2015, is also acontinuation-in-part of U.S. application Ser. No. 14/488,761, entitled“Bit Indexed Explicit Replication,” filed Sep. 17, 2014, which in turnclaims the domestic benefit under Title 35 of the United States Code §119(e) of U.S. Provisional Patent Application No. 61/878,693, entitled“Multicast IPv6 with Bit Mask Forwarding,” filed Sep. 17, 2013, and U.S.Provisional Patent Application No. 61/931,473, entitled “Bit MaskForwarding Architectures for Stateless Multipoint Replication,” filedJan. 24, 2014. U.S. application Ser. No. 14/603,580, entitled “Bit MaskForwarding Replication Optimization,” filed Jan. 23, 2015, is also acontinuation-in-part of U.S. application Ser. No. 14/488,810, entitled“Bit Indexed Explicit Replication Using Internet Protocol Version 6,”filed Sep. 17, 2014, which in turn claims the domestic benefit underTitle 35 of the United States Code § 119(e) of U.S. Provisional PatentApplication No. 61/878,693, entitled “Multicast IPv6 with Bit MaskForwarding,” filed Sep. 17, 2013, and U.S. Provisional PatentApplication No. 61/931,473, entitled “Bit Mask Forwarding Architecturesfor Stateless Multipoint Replication,” filed Jan. 24, 2014. Each of thetwo provisional and four non-provisional applications referenced aboveis hereby incorporated by reference in their entirety and for allpurposes as if completely and fully set forth herein.

BACKGROUND OF THE INVENTION

Network nodes forward data. Network nodes may take form in one or morerouters, one or more bridges, one or more switches, one or more servers,or any other suitable communications processing device. The data iscommonly formatted as packets and forwarded using forwarding tables. Apacket is a formatted unit of data that typically contains controlinformation and payload data. Control information may include:information that identifies sources and destinations, such as addresses,error detection codes like checksums, sequencing information, etc.Control information is typically found in packet headers and trailers.Payload data is typically located between the packet headers andtrailers.

Forwarding packets involves various processes that, while simple inconcept, can be complex. The processes involved in forwarding packetsvary, depending on the type of forwarding method used. In many networks,multicast is the preferred method of data forwarding. One reason forthis is that multicast is a bandwidth-conserving technology that reducestraffic by simultaneously delivering data to multiple receivers.However, in traditional multicast systems, a relatively large amount ofcontrol plane information is used. Setting up and maintaining thiscontrol information has a tendency to become complex and costly in termsof computing resources, and can become a major limiting factor inoverall network performance. Another issue with multicast is that due topacket delivery mechanisms used, packets are sometimes forwarded tolocations where the packets were not desired. This unnecessary deliveryof packets represents an unwelcome burden on network performance.Overcoming this burden by traditional means involves generation andmaintenance of even more control plane information.

BRIEF DESCRIPTION OF THE DRAWINGS

The present disclosure may be better understood, and its numerousobjects, features, and advantages made apparent to those skilled in theart by referencing the accompanying drawings.

FIG. 1 is a simplified block diagram illustrating certain components ofan example network.

FIG. 2 is a simplified block diagram illustrating certain components ofan example network.

FIG. 3A is a flow chart illustrating an example process employed by anode, according to the present description.

FIG. 3B is a flow chart illustrating an example process employed by anode, according to the present description.

FIG. 4 is a simplified block diagram illustrating certain components ofan example network.

FIG. 5A is an example table generated by node, according to the presentdescription.

FIG. 5B is an example table generated by node, according to the presentdescription.

FIG. 5C is an example table generated by node, according to the presentdescription.

FIG. 6 is a flow chart illustrating an example process employed by anode, according to the present description.

FIG. 7 is a flow chart illustrating an example process employed by anode, according to the present description.

FIG. 8A is a flow chart illustrating an example process employed by anode, according to the present description.

FIG. 8B is a flow chart illustrating an example process employed by anode, according to the present description.

FIG. 9 is a flow chart illustrating an example process employed by anode, according to the present description.

FIG. 10 is a block diagram illustrating certain components of an examplenode that can be employed, according to the present description.

FIG. 11 is a block diagram depicting a computer system suitable forimplementing embodiments of the systems described herein.

FIG. 12 is a block diagram depicting a network device suitable forimplementing embodiments of the systems described herein.

DETAILED DESCRIPTION

Overview

Various systems and methods for performing bit indexed explicitreplication (BIER). For example, one method involves receiving a packetat a node. The packet includes a bit string. The node traverses the bitstring and selects an entry in a bit indexed forwarding table (BIFT).The entry includes a forwarding bit mask. Based on the forwarding bitmask and the bit string, the node forwards the packet.

Multicast

Multicast delivers multicast data packets (data packets thattraditionally include information identifying a multicast group, such asa multicast group address) from a source to multiple receivers withoutunduly burdening the source. As used herein, the term “receiver”signifies a host (such as a computing device or application) thatsubscribes to a multicast group. Instead of the source replicating amulticast data packet and sending a copy of the multicast data packet toeach receiver, the source sends a single copy of a multicast data packetand multicast-enabled routers (referred to herein simply as nodes)replicate the packet at the point(s) where paths to various receiversdiverge. Multicast routing protocols enable multicast transmission(i.e., one-to-many connections and many-to-many connections) byreplicating a multicast data packet close to the destination of thatmulticast data packet, obviating the use of multiple unicast connectionsfor the same purpose. This saves network bandwidth and improvesthroughput.

FIG. 1 is a simplified block diagram of a network 100 performingmulticast data transmission. Multicast-enabled nodes 110, 120, 130 and140 are coupled through network links 150, 160, and 170.Multicast-enabled node 110 is also coupled to source 111 and receiver112; multicast-enabled node 120 is coupled to receiver 121;multicast-enabled node 130 is coupled to receiver 131 and receiver 132;and multicast-enabled node 140 is coupled to receiver 141. Such couplingbetween the multicast-enabled nodes and the sources and/or receivers canbe direct or indirect (e.g., via a L2 network device or another node).

For the purposes of this illustration, source 111 is a host configuredto transmit multicast data packets to a multicast group that includes asreceivers hosts 112, 121, 131, 132 and 141. Source 111 transmits amulticast flow, consisting of one or more multicast data packets havinga common multicast group address, to multicast-enabled node 110(illustrated by the arrow from 111 to 110). Multicast-enabled node 110includes a multicast forwarding table that multicast-enabled node 110uses to determine where to forward the multicast data packets associatedwith the multicast flow. The multicast forwarding table includesinformation identifying each interface of multicast-enabled node 110that is connected to a multicast distribution tree (MDT) to one or morereceivers for the multicast group (e.g., a host that has sent a joinmessage, as described above). Multicast-enabled node 110 then replicatesmulticast data packets in the multicast flow and transmits thereplicated multicast data packets from the identified interfaces toreceiver 112, multicast-enabled node 120, and multicast-enabled node130.

Multicast-enabled nodes 120 and 130 inform node 110 that they arecoupled to one or more receivers using join messages, for example, aprotocol independent multicast (PIM) join message. In response toreceiving the join messages, multicast-enabled node 110 updates itsmulticast forwarding tables to identify interfaces to which multicastdata packets should be forwarded. The multicast data packets can bereplicated by node 110 as needed in order to provide the multicast datapackets to receivers for the multicast group (e.g., receivers 131 and132) and other multicast-enabled nodes on the MDT (e.g.,multicast-enabled node 140). In this manner, a multicast flow fromsource 111 can be transmitted through a multicast network to multiplereceivers.

As can be seen, the process traditionally used in multicast of settingup MDTs and updating multicast forwarding tables for each group resultsin considerable amounts of state information within the network. Themulticast forwarding tables maintained by each multicast-enabled node,in particular, can become quite large. Maintaining such multicastforwarding tables represents limitations on network scalability.

Bit Indexed Explicit Replication

As described below, techniques are used to attach receiver informationto packets in the form of bits and forward the packets based on thereceiver information. This greatly reduces the amount of stateinformation stored at nodes and is therefore also referred to as“stateless multicast.” More formally, the term Bit Indexed ExplicitReplication (BIER) is used to describe these techniques. As suggested bythe term, a bit position is used as an index into a forwarding table andpackets are replicated only to specified nodes. BIER enables packets tobe forwarded from a source to multiple receivers without the use ofmulticast distribution trees and per-group state information at eachnode between the source and receivers.

FIG. 2 shows an example network 200. Network 200 includes BIER-enablednodes 206-218. BIER-enabled nodes are configured to forward packetsusing BIER, and are referred to as bit forwarding routers (BFRs). BFRs206-218 form a provider network, or domain. Such a provider networkcould be employed by an Internet service provider to transport packetsto customers. The domain includes transit nodes 208 and 210, andprovider edge nodes 206, 214, 216, and 218. The provider edge nodes arecoupled to customer edge nodes 211, 213, 215, and 217. Hosts 201, 203,205, and 207 are computing devices coupled to the customer edge nodes.

Each of the BFRs 206-218 has interfaces that are identified as shown.For example, BFR 208 has three interfaces designated 1-3, respectively.Each BFR is assigned a unique identifier or routable address known as arouter identifier (RID). The RID can be implemented as, for example, aninternet protocol (IP) address, a prefix, or a loopback address. EachBFR advertises or floods the routable address to all other BFRs innetwork 200. Each BFR builds a unicast topology of the BFRs in network200 using the advertised routable addresses. In one embodiment, therouter identifier can be mathematically converted to the set identifierand bit position assigned to a BFR. The conversion depends on the lengthof bit string being used. For example, to convert a router identifier‘N’ to a set identifier and bit position, the set identifier is theinteger part of the quotient (N−1)/BitStringLength. The bit position is((N−1) modulo BitStringLength)+1. If, in the above example, N equals 257and the BitStringLength is 256, the SI is 1 and the BP is 1. BIERnetwork 200 also includes a node configured to operate as a multicastdata controller (MDC) 230. The MDC performs configuration andadministrative tasks, as described below.

BFR 206 is configured as a bit forwarding ingress router (BFIR) formulticast data packets. BFR 206 is coupled, via customer edge node 211,to source 201. Multicast data packets from source 201 enter the BIERnetwork via the BFIR (BFR 206). Each of BFRs 214, 216, and 218 isconfigured as a bit forwarding egress router (BFER). The BFERs can beconnected (directly or via customer edge routers) to hosts, such asreceivers, or other networks. A BFER is a BFR that is the last BFR on apath between a source and a receiver. The BFER may be a provider edge(PE) node that is coupled to the receiver either directly or indirectly(e.g., through a non-BIER-enabled CE node).

Assigning a Bit Position in the Bit String

Each BFER in a BIER network is assigned a bit position (BP) from a setor array of bits. The array of bits can be carried in a packet or othernetwork message. The array of bits can also be stored in forwardingand/or routing tables. For the sake of clarity, the terms used hereinare “bit string” (when the array of bits is in a packet) and “bit mask”(when the array of bits is stored in a table). Also, it is noted thatBFIRs can act as BFERs and vice versa. BFIRs are also assigned bitpositions.

The bit string (or bit mask) can have a fixed or variable length. Thelength of the bit string used in the BIER network can be staticallyconfigured or dynamically assigned, and is distributed through the BIERnetwork. In one embodiment, the length of the bit string is between 256and 1024 bits, though shorter or longer bit strings can be used. Themaximum length of the bit string value is determined, in one embodiment,by hardware or software limitations of the BFRs in the BIER network. Inone embodiment, different BFRs in the BIER network use different lengthsfor their respective bit strings. For example, one BFR may have amaximum bit string length of 128 bits while another BFR may have amaximum bit string length of 256 bits. A bit string is one type ofmulticast forwarding entry in which each bit position of multiple bitpositions is an element that can be used to represent an individual nodeor interface. Other types of multicast forwarding entries with othertypes of elements can be used.

A bit position (BP) is statically or dynamically assigned to each BFER.Each BFER should have at least one unique bit position from the bitstring. In one embodiment, a central authority, such as a multicastdomain controller, will assign the BPs to the BFERs. The multicastdomain controller, in one embodiment, assigns multiple BPs to a singleBFER, e.g., a unique BP for each of one or more interfaces included inthe BFER. Other mechanisms for assigning BPs can be implemented as well,such as deriving a BP from a router identifier assigned to a BFR, wherethe derivation utilizes a mapping algorithm. In some embodiments, a bitposition in the bit string is assigned to a single BFER. In otherembodiments, a single BP can be assigned to more than one BFER. Whenmultiple BFERs are assigned the same BP, one of the multiple BFERs canassume ownership of the BP at a given time, and ownership can betransferred between the multiple BFERs. Ownership of the BP can betransferred to another one of the multiple BFERs for any of severalreasons, such as a failover in response to a node or link failure, or ifone of the multiple BFERs otherwise becomes unavailable, in response tochanging network conditions, due to time-sharing considerations, and thelike. Assigning one BP to multiple BFERs facilitates operation similarto anycast, in which packets are forwarded to one receiver of a group ofreceivers, where each receiver in the group of receivers uses a commonaddress.

Only the BIER-enabled edge nodes (e.g., BFERs and BFIRs) in a BIERnetwork are assigned a BP. All other BFRs in the network (e.g., transitnodes) don't need a BP to participate in BIER. This helps to reduce thenumber of bits assigned in a network. As shown in the example of FIG. 2,network 200 utilizes a four bit long bit string. Each of the four BFERs(including BFIR node 206) in network 200 is assigned a BP: node 206 isassigned BP {1000}; node 214 is assigned BP {0001}; node 216 is assignedBP {0100}; and node 218 is assigned BP {0010}.

Sets

The number of BFERs that can be addressed (assigned a BP) is limited bythe size of the bit string included in the multicast data packet. If thebit string is four bits long, the number of BFERs that can be addressedis four. The concept of sets allows an increase in the number of BFERsthat can be assigned BPs. The set identifier (SI) is, for example, anumber between 0 and 255. The SI allows a BP to be unique in the contextof a set. For example, each BP can be re-used in each set. In anembodiment with 256 sets and a bit string length of 256 bits, 65536(256×256) BFERs can be supported. In one embodiment, BFRs in the BIERnetwork generate separate forwarding information for each SI. Forexample, if two different set identifiers are in use in the BIERnetwork, the BFRs generate two bit indexed forwarding tables (BIFTs),one corresponding to each SI. In response to receiving a multicast datapacket having a SI, the BFR uses the SI to select which forwardinginformation (e.g., BIFT) to use to forward the multicast data packet.

Signaling

When a receiver (e.g., a host, such as host 203 of FIG. 2) wishes tojoin a multicast group, the receiver sends membership information (e.g.,using Internet Group Management Protocol (IGMP)) to the BFER thereceiver is coupled to (either directly or indirectly). The membershipinformation identifies the multicast group the host wishes to join andoptionally identifies a source associated with the multicast group. Inthe example of FIG. 2, host 203 can send an IGMP message to CE node 213and CE node 213 can then forward the IGMP message to BFR 214. Inresponse to receiving a message indicating a receiver wishes to join amulticast group, the BFER signals its interest in the multicast groupidentified in the message. This involves, in one embodiment, the BFERsending a membership message to any BFIRs in the network, or to acontroller, indicating the BFER's interest in the multicast group andincluding the BFER's BP. In one embodiment, BFERs use border gatewayprotocol (BGP) to send membership messages.

The BFER can send the membership message to a subset (less than all) orto all of the BFRs on the edge of the BIER network (BFIRs and BFERs) orcan flood the signaling message to all nodes in the network. Forexample, if the network is using source-specific multicast (SSM), theBFER knows the source of the multicast group (e.g., from the IGMPmessage received from the receiver) and can look up a path to thespecified BFIR and send the signaling message to that BFIR only. If SSMis not the type of multicast being used, the BFER can flood thesignaling message to all candidate BFIRs. Only BIER-enabled edge nodesparse the message to determine group and BP information, all other nodescan discard the message. Receivers joining and unsubscribing frommulticast groups do not create churn or require any changes in the stateinformation (e.g., bit indexed forwarding tables) maintained by the coreBFRs, unlike with traditional multicast. Instead, join or unsubscribemessages signal BFIRs to update the bit string associated with a givenmulticast group. This involves only the BFIRs updating state information(e.g., updating a group membership table associated with the group) andnot the core nodes. This represents a significant improvement overtraditional multicast, in which multicast distribution trees are set upand torn down throughout the network based on the join and unsubscribemessages.

Each BFIR, such as BFR 206 of FIG. 2, maintains state information thatincludes an entry for each multicast group for which the BFIR hasreceived a membership message. In one embodiment, the BFIR maintains thestate in a group membership table (GMT), as shown at 224 of FIG. 2. Inone embodiment, each entry in the GMT includes information identifyingthe multicast group (such as a multicast group name and/or an address ofa source for the multicast group), a list of BPs corresponding to BFERsthat have expressed interest (e.g., via a membership message) in themulticast group identified in the group field, and a bit string, whichidentifies all of the BFERs that have expressed interest in themulticast group (e.g., by having a bit set in the bit positioncorresponding to each BFER that has expressed interest in the multicastgroup). In response to receiving a membership message from a BFERindicating that the BFER is interested in a multicast group, the BFIRsets the bit corresponding to the BFER's BP in the bit string thatcorresponds to the multicast group. The membership message includes, inone embodiment, the BFER's BP. Alternatively, the BFIR performs a lookupto determine the BP associated with the BFER that originated themembership message using the address of the BFER that originated themembership message and the information advertised via IGP, e.g.,information in the BFIR's BIRT. When the BFER is no longer interested inreceiving multicast data packets for the multicast group, the BFERsignals to the BFIR, e.g., using an unsubscribe message, and the BFIRclears the corresponding bit in the bit string.

The BIER network forwards multicast data packets through the BIERnetwork based on the bit string. The BFIR transmits the bit string alongwith multicast data packets into the BIER network. There are number ofdifferent techniques available for transmitting the bit string. Thisdescription refers to encapsulating the bit string into the multicastdata packet. This terminology covers not only incorporating the BM intothe multicast data packet (e.g., as header or payload information), butalso appending or prepending some or all of the bit string to themulticast data packet.

FIG. 3A is a flowchart that illustrates the operations discussed above.In one embodiment, the method is performed by a BFER, such BFR 214 ofFIG. 2. At 302, the BFER requests BIER information, such as a bitposition and set identifier. Requesting BIER information involves, in onembodiment, the BFER sending a message to a multicast domain controller,such as multicast domain controller 230 of FIG. 2. In one embodiment,the BIER information is automatically provided to the BFER in responseto detecting the BFER has joined the network, or in response to someother condition. An administrator can manually configure the BFER with aBP and set identifier.

At 304, the BFER receives the BIER information, either as a result ofadministrative configuration, or, for example, included in a messagefrom the MDC in response to the request for BIER information. At 306, inresponse to the BFER receiving the BIER information, the BFER advertisesits BIER information and its router identifier, to some or all of theother nodes in the BIER network. In one embodiment, the BFER advertisesits BP and SI via an interior gateway protocol (IGP). For example,Intermediate System to Intermediate System (ISIS) and/or Open ShortestPath First (OSPF) can be modified to assist in distributing thisinformation through the BIER network using link state updates. Otherflooding mechanisms to distribute the information are possible. All BFRsin a BIER network, not just the BFERs, also flood their routeridentifier, which is used in building network topology and unicastforwarding tables. BFRs, in one embodiment, advertise additionalinformation as well, such as a bit string size that the BFR isconfigured to use. Adding such BIER information to the advertisedinformation is a relatively small amount of additional information, ascompared with the state information maintained on a per-group basis intraditional multicast.

FIG. 3B shows an example method of utilizing overlay signaling todistribute membership information in a BIER network. In one embodiment,the method of FIG. 3B is performed by a BFER, such as BFR 214 of FIG. 2.

At 322, the BFER receives a membership request from a host, such as host203 of FIG. 2. The membership request is optionally relayed through acustomer edge node, such as customer edge node 213 of FIG. 2. In oneembodiment, the membership request comprises an IGMP message. Themembership request includes information identifying a multicast group,and information identifying whether the host wishes to join, e.g.subscribe, or leave, e.g. unsubscribe from, the multicast group. Inresponse to receiving the membership request, the BFER updatesforwarding information indicating the host's membership in the multicastgroup. For example, if the membership request indicates that the hostwishes to join multicast group G1, the BFER updates a forwarding entrysuch that any multicast data packets received by the BFER and addressedto multicast group G1 will be forwarded to the host by the BFER.

At 324, the BFER generates a membership message. The membership messagesignals the BFER's interest in the multicast group. The membershipmessage carries information identifying the multicast group, andinformation identifying the BFER, such as the set identifier and bitposition of the BFER. In one embodiment, the membership message isimplemented using BGP.

At 326, the BFER transmits the membership message. In one embodiment,transmitting the membership message involves forwarding the membershipmessage to a multicast domain controller, such as MDC 230 of FIG. 2. TheMDC then transmits the membership message to some or all of the edgerouters in the BIER domain, e.g., BFERs and BFIRs.

Bit Indexed Routing and Forwarding Tables

Each BFR in the BIER network uses the advertised BPs and routeridentifiers of the other BFRs to generate one or more bit indexedrouting tables (BIRTs) and bit indexed forwarding tables (BIFTs). TheBFRs use these tables to forward multicast data packets. A bit indexedrouting table, as shown by example BIRT 262 of FIG. 2, is a table thatstores BP-to-router identifier mappings, e.g., as learned via aninterior gateway protocol (IGP). Each BFR receives BP-to-routeridentifier mappings and stores them in a BIRT. Using the routeridentifiers, a BFR performs a recursive lookup in unicast routing tablesto identify a directly connected next hop BFR (referred to herein as aneighbor (NBR)) on the shortest path from the BFR toward the BFRassociated with the BP. In one embodiment, the NBR is the next hop on ashortest path (SPT) towards the BFER that advertised the BP. In oneembodiment, the BIRT includes one entry per BP.

Each BFR translates its BIRT(s) into one or more bit indexed forwardingtables (BIFTs). FIG. 2 illustrates an example BIFT 264 as generated byBFR B 208. Generating a BIFT involves, in one embodiment, first sortingthe BIRT by neighbor. For entries in the BIRT that have a common NBR,the bit masks of those entries are OR'd together, creating a bit maskthat is a combination of the BPs from those entries. This is referred toherein as a forwarding bit mask (FBM) or just bit mask (BM). Each bitthat is set in the FBM indicates a BFER that is reachable via theneighbor associated with the FBM in the BIFT. Multicast data packets areforwarded by the BFRs using the BIFTs. For example, according to BIFT264, if a multicast data packet having a bit string with BP {0001} setarrives at node 208, the multicast data packet should be forwarded toNBR C (BFR 210 in the example of FIG. 2). If a multicast data packetarrives at node 208 having a BP of {1000} set, the multicast data packetshould be forwarded to NBR A (BFR 206 in the example of FIG. 2). If amulticast data packet arrives at node 208 having a bit string of {1001},the multicast data packet should be forwarded to both NBR A and NBR C.

Packet Forwarding

After encapsulating the bit string into a multicast data packet, theBFIR forwards the multicast data packet to one or more BFRs using theBFIR's BFTS(s). A BFR that receives the multicast data packetdetermines, using the bit string in the multicast data packet and theBFR's own BFT(s), whether to forward the multicast data packet to one ormore of its neighbors, and if so, to which one(s). To do so, the BFRcompares the bit string in the multicast data packet with the FBMentries in the BFR's BFT. In one embodiment, the BFR performs a logicalAND operation between the multicast data packet's bit string and theFBMs in the BFR's BFT. As noted, the BFR's BFT includes, in oneembodiment, an entry for each neighbor of the BFR, and each entryincludes a FBM field that indicates which BFERs are reachable along ashortest path via the neighbor identified in the entry. If the result ofthe AND is TRUE for a given neighbor, the BFR forwards the multicastdata packet to that neighbor. A TRUE result indicates that an FBM for agiven neighbor in the BFR's BIFT has one or more bits set to 1 and thata corresponding bit (or bits) in the multicast data packet's bit stringis also set to 1. The set bits in the multicast data packet's bit stringindicate which BFERs have expressed interest in the multicast group, andthe set bit in the BFR's BIFT entry indicates that the BFER that hasexpressed interest is reachable via the neighbor indicated in the entry.A BFR forwards a multicast data packet that contains a bit string to allneighbors for which the bit-wise AND operation between the bit string inthe multicast data packet and the FBMs in the BFR's BIFT is TRUE.

In the example of FIG. 2, BFR 214 (a BFER) signals to BFR 206 (a BFIR)that BFR 214 is interested in receiving packets associated with a givenmulticast group or flow. BFRs 216 and 218 likewise signal BFR 206 thatBFRs 216 and 218 are interested in the same multicast group. Thesignaling is represented by the dashed lines shown in FIG. 2. BFR 206updates an entry in group membership table 224 (or creates one if onedoes not already exist) for the multicast group and updates a bit stringin the entry by setting bits corresponding to BFRs 214, 216, and 218.The bit string corresponding to the multicast group is {0111}.

BFR 206 is configured to receive a multicast data packet addressed tothe multicast group or flow (e.g., from source 201 via CE node 211). BFR206 uses the multicast group address and/or source address included inthe multicast data packet to access its GMT and select a bit stringassociated with the multicast group. After selecting a bit string thatcorresponds to the multicast group from the GMT, BFR 206 encapsulatesthe bit string for that multicast group into the multicast data packetand identifies the neighbors to which the packet will be forwarded(e.g., using its BFT). In one embodiment, this involves performing anAND operation between the bit string and each entry in BFR 206's BIFT.In this example, there is only one entry in the BIFT and the entrycorresponds to BFR 208. This means that the shortest path from BFR 206to all three of the BFERs in network 200 runs through BFR 208. Since theresult of the AND is TRUE for NBR B (BFR 208), BFR 206 forwards themulticast data packet to BFR 208. BFR 206 also modifies the bit stringin the multicast data packet it forwards, as discussed below.

In response to receiving the multicast data packet, BFR 208 performs anAND between the bit string in the multicast data packet, {0111}, andeach entry in its BIFT (shown at 264). The result for NBR C is TRUE soBFR 208 forwards the multicast data packet to BFR 210. BFR 208 alsomodifies the bit string in the multicast data packet it forwards byclearing bits corresponding to BFERs that are not reachable via ashortest path through the BFR to which the multicast data packet isbeing forwarded. In this example, since node E is not reachable via nodeC (the shortest path from node B to node E does not run through C), nodeB clears the bit corresponding to node E in the bit string that node Bforwards to node C, Thus, node B forwards the multicast data packet tonode C with the bit string {0011}. The result for NBR E is also TRUE, soBFR 208 replicates the multicast data packet and forwards the multicastdata packet to BFR 216, which is a BFER. Node B updates the bit stringand forwards the multicast data packet to node E with the bit string{0100}.

BFR 210, in response to receiving the multicast data packet, performs anAND between the bit string in the multicast data packet, {0011}, andeach entry in its BIFT and forwards the multicast data packet to BFR 214which is a BFER and BFR 218.

Packet Forwarding Optimization

The process of packet forwarding described above involves a BFRcomparing the bit string in an incoming multicast data packet with eachentry in the BFR's BIFT. Each entry in the BFR's BIFT corresponds to aneighbor, so if the number of neighbors is large, the number ofcomparisons the BFR performs also is large. Each comparison represents aburden on computing resources and takes time. Therefore, it is desirableto reduce the number of comparisons involved in forwarding a multicastdata packet. An embodiment, described below with regard to FIGS. 4-9,involves optimizing the forwarding information maintained by BFRs in aBIER network. A BFR generates a BIFT that is indexed by bit position,rather than by neighbor. The BFR compares the bit string with BIFTentries that correspond to set bits in the bit string. In thisembodiment, the maximum number of comparisons the BFR performs islimited by the length of the bit string.

FIG. 4 is a simplified block diagram illustrating certain components ofan example network 400. BFRs 410 through 416 are BFRs configured astransit nodes. BFRs 420 through 428 are BFRs configured as BFERs, withbit positions assigned as shown. BFR 406 is a BFR that can be either atransit node or a BFIR. In one embodiment, the BFRs are coupled directlyor indirectly to one or more hosts (not shown), e.g., via one or morecustomer edge nodes (not shown). Network 400 also includes, optionally,a multicast data controller (not shown).

FIG. 5A shows an example bit indexed routing table (BIRT) 502 generatedby a BFR, such as node 406 of FIG. 4. BIRT 502 includes three columns.The first column includes information identifying a bit position and(optionally) set identifier for each BFER included in the BIER network(e.g., network 400 of FIG. 4). In one embodiment, the column alsoincludes a BFR-ID, or virtual bit position. A virtual bit position canbe implemented as an integer unique among the edge routers in a BIERnetwork. The second column includes a routable address for each of theBFERs, such as a loopback or BFR prefix. The third column includes oneor more entries for the neighbor or neighbors via which the BFERcorresponding to the BFR prefix is reachable along a shortest path fromthe node.

FIG. 5B is an example bit indexed forwarding table (BIFT) 504 generated,for example, by a BFR, such as node 406 of FIG. 4. BIFT 504 is generatedusing the information in BIRT 502 of FIG. 5A. As shown, BIFT 504includes a column for a forwarding bit mask (FBM). BIFT 504 alsoincludes a column for neighbors. BIFT 504 indicates that each of the bitpositions set in the FBM is reachable via the corresponding neighbor.For example, considering the first entry in BIFT 504, BFERscorresponding to bit positions 1, 2, and 3 are reachable via neighbor A.BIFT 504 is indexed by NBR. That is, there is an entry corresponding toeach of the BFRs neighbors.

FIG. 5C is an example optimized bit indexed forwarding table (BIFT) 506generated, for example, by a BFR, such as node 406 of FIG. 4. OptimizedBIFT 506 is indexed by bit position. Optimized BIFT 506 includes anentry for each bit position in a BIER network. The entry includesinformation identifying the BP, a neighbor via which a BFERcorresponding to the BP is reachable, and a forwarding bit mask thatidentifies all BFERs reachable via the neighbor. A BFR generates anoptimized BIFT using the BFR's BIRT and/or the BFRs BIFT (e.g., the BFRcan convert a non-optimized BIFT into an optimized BIFT).

FIG. 6 is a flow chart illustrating an example process for generating abit indexed routing table (BIRT). In one embodiment, FIG. 6 is performedby a BFR, such as BFR 406 of FIG. 4.

At 602, the BFR receives an advertisement generated by a BFER. In oneembodiment, the advertisement is received via IGP and includesinformation identifying a mapping between a routable address associatedwith the BFER, such as a router identifier, and a bit position and setidentifier associated with the BFER. In response to receiving theadvertisement, the BFR determines, at 604, the BFR ID and BP included inthe advertisement. The BFR also determines the set identifier, if one isincluded in the advertisement.

At 606, the BFR accesses its stored topology information to determinethe next hop neighbor along the shortest path towards the BFER thatgenerated the advertisement. At 608, the BFR updates the BIRT. In oneembodiment, this comprises adding an entry that includes informationidentifying the BFR ID and bit position of the BFER, as well as theneighbor via which the BFER is reachable.

FIG. 7 is a flow chart illustrating an example process of generating aBIFT employed by a node, according to the present description. In oneembodiment, FIG. 7 is performed by a BFR, such as BFR 406 of FIG. 4.Performance of the process of FIG. 7 produces a BIFT that is indexed byneighbor.

At 702, the BFR selects a first entry in the BFR's BIRT. At 704, the BFRdetermines the neighbor and bit position associated with the entry. Inone embodiment, the neighbor is identified by a BFR ID or prefix orother information. At 706, the BFR determines whether the BIFT alreadyincludes an entry for the neighbor. In the case of the first BIRT entry,there is not a BIFT entry corresponding to the neighbor identifying theBIRT entry. If there is no BIFT entry corresponding to the neighbor, theBFR creates a BIFT entry corresponding to the neighbor at 708. At 710,the BFR sets a bit in the FBM corresponding to the bit positionidentified in the BIRT entry. At 712, the BFR determines whether theBIRT includes additional entries. If so, the BFR selects the next BIRTentry at 714.

For example, considering BIRT 502 of FIG. 5A, at 702, the BFR selectsthe first entry in the BIRT. In the example of BIRT 502, the first entrycorresponds to neighbor D. Next, the BFR determines, at 704, that theentry corresponds to BP 5. That is, node P, which has been assigned BP5, is reachable from the BFR via node D. Next, the BFR creates an entryin its BIFT for neighbor D and sets the fifth bit in the FBM. FIG. 5Bshows that the entry corresponding to neighbor D has bit 5 set. The BFRthen proceeds iteratively, next selecting the entry for neighbor Ccorresponding to bit 4, and so on. After performing the method shown inFIG. 7, the BFR will have a BIFT that includes one entry for each of theBFR's neighbors and each entry will include a forwarding bit mask thathas a bit set for each of the BFERs reachable that neighbor.

FIG. 8A is a flow chart illustrating an example process of generating aBIFT employed by a node, according to the present description. In oneembodiment, FIG. 8A is performed by a BFR, such as BFR 406 of FIG. 4.FIG. 8A depicts a method of converting an existing non-optimized BIFT,as shown in FIG. 5B into an optimized BIFT, as shown in FIG. 5C.

At 804, the BFR selects the first BP in a bit string. In one embodiment,the bit string corresponds to a forwarding bit mask in the non-optimizedBIFT. At 806, the BFR creates an entry corresponding to the selected BPin the optimized BIFT. In one embodiment, the entry includes a field forinformation identifying a neighbor via which a BFER corresponding to theBP is reachable and an FBM, which is initially blank, but which the BFRupdates to include information identifying which bit positions arereachable via the neighbor.

At 810, the BFR selects the first entry in the non-optimized BIFT. At812, the BFR determines whether the selected bit position includes a setbit in the FBM of the non-optimized BIFT. If not, the BFR determines, at818, whether the non-optimized BIFT includes more entries. If so, theBFR selects the next entry in the non-optimized BIFT and determines ifthe selected bit position is set. This continues until either thenon-optimized BIFT has been completely traversed or the BFR locates anentry that has the selected BP set.

In response to detecting an entry with the corresponding BP set, the BFRcopies, at 814, information identifying the neighbor from thecorresponding entry in the non-optimized BIFT to the entry in theoptimized BIFT. The BFR also copies, at 816, the FBM from the entry inthe non-optimized BIFT to the entry in the optimized BIFT. At 822, theBFR determines whether additional BPs remain in the bit string. If so,the BFR selects the next BP, at 824. The method continues until all BPsin the bit string have been processed.

Using the example of FIG. 5B, the BFR selects the first bit in the FBM(BP=1) and determines if the first bit is set in the first entry of BIFT504, e.g., the entry corresponding to NBR A. Since BP=1 is not set inthe entry corresponding to NBR A, the BFR selects the second entry, andthen the third entry, which also do not include a set bit at BP=1. Inresponse to determining that the fourth entry, corresponding to NBR D,has the first bit set, the BFR updates the entry in the optimized BIFT(e.g., BIFT 506 of FIG. 5C) to include the FBM and NBR information fromthe fourth entry of BIFT 504. BFR then selects the next bit in the bitstring (BP=2) and repeats the process iteratively until all bits in thebit string have been processed.

FIG. 8B is a flow chart illustrating an example process of generating aBIFT employed by a node, according to the present description. In oneembodiment, FIG. 8B is performed by a BFR, such as BFR 406 of FIG. 4.FIG. 8B depicts a method of generating an optimized BIFT, as shown, forexample, in FIG. 5C, using information from the BFR's BIRT, as shown,for example, in FIG. 5A. In this embodiment, the optimized BIFT can begenerated even if no existing BIFT is present.

At 850, the BFR selects the first BP in a bit string. At 852, the BFRcreates an entry corresponding to the selected BP in the optimized BIFT.In one embodiment, the entry includes a field for informationidentifying a neighbor via which a BFER corresponding to the BP isreachable and an FBM, which is initially blank, but which the BFRupdates to include information identifying which bit positions arereachable via the neighbor.

The BFR locates, at 854, an entry corresponding to the selected BP inthe BIRT. At 856, the BFR determines the neighbor associated with the BPin the entry. The BFR updates, at 858, the neighbor field in theoptimized BIFT entry corresponding to the BP. This indicates that the BP(e.g., the BFER assigned the BP) is reachable via the neighbor. The BFRalso sets the bit corresponding to the BIRT entry, if the bit is notalready set.

At 860, the BFR determines whether the BIRT includes additional entriesthat are associated with the neighbor. If so, the BFR selects the nextentry and updates the optimized BIFT entry. In one embodiment, updatingthe BIFT entry involves setting the bit associated with the BIRT entry.

The BFR determines, at 864, whether additional bit positions remain inthe bit string. If so, the BFR selects the next bit position in the bitstring at 866. The method continues until all bit positions in the bitstring are processed.

Using the example of BIRT 502, the BFR first selects BP=1 and locates anentry in BIRT 502 corresponding to BP=1. BIRT 502 indicates that BP=1 isreachable via node D. The BFR updates the entry corresponding to BP=1with information identifying node D and setting the bit in BP=1. The BFRthen determines if BIRT 502 includes any other entries corresponding tonode D. The entry corresponding to BP=5 in BIRT 502 indicates that nodeD is the next hop via which the BFER corresponding to BP=5 is reachable.In response to determining that there is another entry corresponding tonode D, the BFR updates the FBM corresponding to BP=1 in the optimizedBIFT by setting the bit corresponding to BP=5.

FIG. 9 is a flow chart illustrating an example process of forwarding amulticast data packet employed by a node, according to the presentdescription. In one embodiment, the process is performed by a BFR, suchas BFR 406 of FIG. 4. The process of FIG. 9 can be performed using anoptimized BIFT, such as optimized BIFT 506 of FIG. 5C.

At 902, the BFR receives a multicast data packet. The BFR determines, at904, whether the multicast data packet is a BIER packet, and thereforeincludes a bit string. In one embodiment, the BFR scans the header ofthe multicast data packet for a value that indicates that the multicastdata packet is a BIER packet. The BFR can detect that the sender of themulticast data packet was a BFR and therefore conclude that themulticast data packet is a BIER multicast data packet. If the multicastdata packet is not a BIER multicast data packet, the BFR performsalternate processing at 906. In one embodiment, alternate processing 906involves flooding the multicast data packet to all interfaces on theBFR, or dropping the multicast data packet. Alternatively, iftraditional multicast forwarding information is available, the BFR canuse that information to forward the packet.

If the multicast data packet is a BIER multicast data packet, the BFRknows that the multicast data packet includes a bit string. The BFRlocates the bit string in the multicast data packet at 908. Using thebit string, the BFR determines which neighbors the multicast data packetshould be forwarded to. In one embodiment, this involves selecting, asshown at 910, the first bit of the bit string and determining, as shownat 912, whether the first bit of the bit string is set. If the bit isnot set, the BFR determines, at 922, whether more bits are present inthe bit string. If so, the BFR selects the next bit at 924 and themethod returns to 912.

In response to determining, at 912, that a bit in the bit string is set,the BFR selects, at 914, a forwarding entry with which to forward themulticast data packet. In one embodiment, the BFR selects a forwardingentry that corresponds to the bit position of the set bit in the bitstring. At 916, the BFR creates a copy of the multicast data packet andupdates the bit string in the copy of the multicast data packet.Updating the bit string in the copy of the multicast data packetinvolves clearing bits in the bit string that correspond to neighborsthat are not reachable via a shortest path from the interface to whichthe copy of the packet is being forwarded. This can be accomplished byperforming an AND operation between the bit string from the incomingmulticast data packet and the bit mask in the forwarding table entrythat corresponds to the selected bit. The resulting value is used as thebit string for the copy of the multicast data packet. At 918, the BFRforwards the multicast packet to the interface.

At 920, the BFR updates the bit string that arrived in the multicastdata packet by clearing those bits in the multicast data packet's bitstring that correspond to the bits which were set in the multicast datapacket that the BFR forwarded. In one embodiment, this involvesperforming an AND operation between the bit string in the receivedmulticast data packet, and the inverse of the bit mask in the entrycorresponding to the selected bit. This has the effect of clearing thosebits that correspond to bit positions which were set in the bit stringof the outgoing packet, which prevents looping and duplication. The BFRthen determines, at 922, whether more bits are present in the bitstring. If so, the BFR selects the next bit at 924. The BFR continues towalk to the bit string of the received multicast data packet,bit-by-bit, until the end of the bit string is reached.

FIG. 10 is a block diagram illustrating certain additional and/oralternative components of nodes that can be employed, for example in thenetwork shown in FIG. 3. In this depiction, node 1000 includes a numberof line cards (line cards 1002(1)-(N)) that are communicatively coupledto a forwarding engine or packet forwarder 1010 and a processor 1020 viaa data bus 1030 and a result bus 1040. Line cards 1002(1)-(N) include anumber of port processors 1050(1,1)-(N,N) which are controlled by portprocessor controllers 1060(1)-(N). It will also be noted that forwardingengine 1010 and processor 1020 are not only coupled to one another viadata bus 1030 and result bus 1040, but are also communicatively coupledto one another by a communications link 1070.

The processors 1050 and 1060 of each line card 1002 may be mounted on asingle printed circuit board. When a packet or packet and header arereceived, the packet or packet and header may be identified and analyzedby router 1000 in the following manner upon receipt, a packet (or someor all of its control information) or packet and header is sent from theone of port processors 1050(1,1)-(N,N) at which the packet or packet andheader was received to one or more of those devices coupled to data bus1030 (e.g., others of port processors 1050(1,1)-(N,N), forwarding engine1010 and/or processor 1020). Handling of the packet or packet and headercan be determined, for example, by forwarding engine 1010. For example,forwarding engine 1010 may determine that the packet or packet andheader should be forwarded to one or more of port processors1050(1,1)-(N,N). This can be accomplished by indicating to correspondingone(s) of port processor controllers 1060(1)-(N) that the copy of thepacket or packet and header held in the given one(s) of port processors1050(1,1)-(N,N) should be forwarded to the appropriate one of portprocessors 1050(1,1)-(N,N). In addition, or alternatively, once a packetor packet and header has been identified for processing, forwardingengine 1010, processor 1020 or the like can be used to process thepacket or packet and header in some manner or add packet securityinformation, in order to secure the packet. On a node sourcing such apacket or packet and header, this processing can include, for example,encryption of some or all of the packet's or packet and header'sinformation, the addition of a digital signature or some otherinformation or processing capable of securing the packet or packet andheader. On a node receiving such a processed packet or packet andheader, the corresponding process is performed to recover or validatethe packet's or packet and header's information that has been thuslyprotected.

FIG. 11 is a block diagram of a computing device, illustrating how aforwarding module can be implemented in software, as described above.Computing system 1110 broadly represents any single or multi-processorcomputing device or system capable of executing computer-readableinstructions. Examples of computing system 1110 include, withoutlimitation, any one or more of a variety of devices includingworkstations, personal computers, laptops, client-side terminals,servers, distributed computing systems, handheld devices (e.g., personaldigital assistants and mobile phones), network appliances, switches,routers, storage controllers (e.g., array controllers, tape drivecontroller, or hard drive controller), and the like. In its most basicconfiguration, computing system 1110 may include at least one processor1114 and a system memory 1116. By executing the software that implementsa forwarding module 1117, computing system 1110 becomes a specialpurpose computing device that is configured to perform packetforwarding, in the manner described above.

Processor 1114 generally represents any type or form of processing unitcapable of processing data or interpreting and executing instructions.In certain embodiments, processor 1114 may receive instructions from asoftware application or module. These instructions may cause processor1114 to perform the functions of one or more of the embodimentsdescribed and/or illustrated herein. For example, processor 1114 mayperform and/or be a means for performing the operations describedherein. Processor 1114 may also perform and/or be a means for performingany other operations, methods, or processes described and/or illustratedherein.

System memory 1116 generally represents any type or form of volatile ornon-volatile storage device or medium capable of storing data and/orother computer-readable instructions. Examples of system memory 1116include, without limitation, random access memory (RAM), read onlymemory (ROM), flash memory, or any other suitable memory device.Although not required, in certain embodiments computing system 1110 mayinclude both a volatile memory unit (such as, for example, system memory1116) and a non-volatile storage device (such as, for example, primarystorage device 1132, as described in detail below). In one example,program instructions executable to implement a forwarding moduleconfigured to forward multicast data packets may be loaded into systemmemory 1116.

In certain embodiments, computing system 1110 may also include one ormore components or elements in addition to processor 1114 and systemmemory 1116. For example, as illustrated in FIG. 11, computing system1110 may include a memory controller 1118, an Input/Output (I/O)controller 1120, and a communication interface 1122, each of which maybe interconnected via a communication infrastructure 1112. Communicationinfrastructure 1112 generally represents any type or form ofinfrastructure capable of facilitating communication between one or morecomponents of a computing device. Examples of communicationinfrastructure 1112 include, without limitation, a communication bus(such as an Industry Standard Architecture (ISA), Peripheral ComponentInterconnect (PCI), PCI express (PCIe), or similar bus) and a network.

Memory controller 1118 generally represents any type or form of devicecapable of handling memory or data or controlling communication betweenone or more components of computing system 1110. For example, in certainembodiments memory controller 1118 may control communication betweenprocessor 1114, system memory 1116, and I/O controller 1120 viacommunication infrastructure 1112. In certain embodiments, memorycontroller 1118 may perform and/or be a means for performing, eitheralone or in combination with other elements, one or more of theoperations or features described and/or illustrated herein.

I/O controller 1120 generally represents any type or form of modulecapable of coordinating and/or controlling the input and outputfunctions of a computing device. For example, in certain embodiments I/Ocontroller 1120 may control or facilitate transfer of data between oneor more elements of computing system 1110, such as processor 1114,system memory 1116, communication interface 1122, display adapter 1126,input interface 1130, and storage interface 1134.

Communication interface 1122 broadly represents any type or form ofcommunication device or adapter capable of facilitating communicationbetween computing system 1110 and one or more additional devices. Forexample, in certain embodiments communication interface 1122 mayfacilitate communication between computing system 1110 and a private orpublic network including additional computing systems. Examples ofcommunication interface 1122 include, without limitation, a wirednetwork interface (such as a network interface card), a wireless networkinterface (such as a wireless network interface card), a modem, and anyother suitable interface. In at least one embodiment, communicationinterface 1122 may provide a direct connection to a remote server via adirect link to a network, such as the Internet. Communication interface1122 may also indirectly provide such a connection through, for example,a local area network (such as an Ethernet network), a personal areanetwork, a telephone or cable network, a cellular telephone connection,a satellite data connection, or any other suitable connection.

In certain embodiments, communication interface 1122 may also representa host adapter configured to facilitate communication between computingsystem 1110 and one or more additional network or storage devices via anexternal bus or communications channel Examples of host adaptersinclude, without limitation, Small Computer System Interface (SCSI) hostadapters, Universal Serial Bus (USB) host adapters, Institute ofElectrical and Electronics Engineers (IEEE) 11054 host adapters, SerialAdvanced Technology Attachment (SATA) and external SATA (eSATA) hostadapters, Advanced Technology Attachment (ATA) and Parallel ATA (PATA)host adapters, Fibre Channel interface adapters, Ethernet adapters, orthe like.

Communication interface 1122 may also allow computing system 1110 toengage in distributed or remote computing. For example, communicationinterface 1122 may receive instructions from a remote device or sendinstructions to a remote device for execution.

As illustrated in FIG. 11, computing system 1110 may also include atleast one display device 1124 coupled to communication infrastructure1112 via a display adapter 1126. Display device 1124 generallyrepresents any type or form of device capable of visually displayinginformation forwarded by display adapter 1126. Similarly, displayadapter 1126 generally represents any type or form of device configuredto forward graphics, text, and other data from communicationinfrastructure 1112 (or from a frame buffer) for display on displaydevice 1124.

As illustrated in FIG. 11, computing system 1110 may also include atleast one input device 1128 coupled to communication infrastructure 1112via an input interface 1130. Input device 1128 generally represents anytype or form of input device capable of providing input, either computeror human generated, to computing system 1110. Examples of input device1128 include, without limitation, a keyboard, a pointing device, aspeech recognition device, or any other input device.

As illustrated in FIG. 11, computing system 1110 may also include aprimary storage device 1132 and a backup storage device 1133 coupled tocommunication infrastructure 1112 via a storage interface 1134. Storagedevices 1132 and 1133 generally represent any type or form of storagedevice or medium capable of storing data and/or other computer-readableinstructions. For example, storage devices 1132 and 1133 may be amagnetic disk drive (e.g., a so-called hard drive), a floppy disk drive,a magnetic tape drive, an optical disk drive, a flash drive, or thelike. Storage interface 1134 generally represents any type or form ofinterface or device for transferring data between storage devices 1132and 1133 and other components of computing system 1110. A storage devicelike primary storage device 1132 can store information such as routingtables and forwarding tables.

In certain embodiments, storage devices 1132 and 1133 may be configuredto read from and/or write to a removable storage unit configured tostore computer software, data, or other computer-readable information.Examples of suitable removable storage units include, withoutlimitation, a floppy disk, a magnetic tape, an optical disk, a flashmemory device, or the like. Storage devices 1132 and 1133 may alsoinclude other similar structures or devices for allowing computersoftware, data, or other computer-readable instructions to be loadedinto computing system 1110. For example, storage devices 1132 and 1133may be configured to read and write software, data, or othercomputer-readable information. Storage devices 1132 and 1133 may also bea part of computing system 1110 or may be a separate device accessedthrough other interface systems.

Many other devices or subsystems may be connected to computing system1110. Conversely, all of the components and devices illustrated in FIG.11 need not be present to practice the embodiments described and/orillustrated herein. The devices and subsystems referenced above may alsobe interconnected in different ways from that shown in FIG. 11.

Computing system 1110 may also employ any number of software, firmware,and/or hardware configurations. For example, one or more of theembodiments disclosed herein may be encoded as a computer program (alsoreferred to as computer software, software applications,computer-readable instructions, or computer control logic) on acomputer-readable storage medium. Examples of computer-readable storagemedia include magnetic-storage media (e.g., hard disk drives and floppydisks), optical-storage media (e.g., CD- or DVD-ROMs),electronic-storage media (e.g., solid-state drives and flash media), andthe like. Such computer programs can also be transferred to computingsystem 1110 for storage in memory via a network such as the Internet orupon a carrier medium.

The computer-readable medium containing the computer program may beloaded into computing system 1110. All or a portion of the computerprogram stored on the computer-readable medium may then be stored insystem memory 1116 and/or various portions of storage devices 1132 and1133. When executed by processor 1114, a computer program loaded intocomputing system 1110 may cause processor 1114 to perform and/or be ameans for performing the functions of one or more of the embodimentsdescribed and/or illustrated herein. Additionally or alternatively, oneor more of the embodiments described and/or illustrated herein may beimplemented in firmware and/or hardware. For example, computing system1110 may be configured as an application specific integrated circuit(ASIC) adapted to implement one or more of the embodiments disclosedherein.

FIG. 12 is a block diagram of an exemplary network device that may beassociated with a node in network 200 of FIG. 2. Network device 1250 ofFIG. 12 may, for example, be associated with BIER-enabled node 206 inFIG. 2. In some cases “node” as used herein encompasses one or morenetwork devices associated with the node. “Network devices” as usedherein includes various devices, such as routers, switches, or networkcontrollers that perform routing and/or forwarding functions and supportone or more routing and/or switching protocols. A network devicemaintains one or more routing and/or forwarding tables that storerouting and/or forwarding information identifying paths to various datasources and/or data consumers. In, for example, a multicast-enablednode, a network device implements a multicast routing protocol that isused to convey multicast data packets from a multicast source to amulticast receiver.

In the embodiment of FIG. 12, network device 1250 includes storage formembership information 1252, storage for forwarding information 1264, aforwarding module 1260, and an interface 1262. Interface 1262 is coupledto send and receive packets and/or other network messages. It is notedthat network device 1250 may include additional interfaces, and thateach interface can be a logical or physical interface. In oneembodiment, interface 1262 includes one or more ports.

Forwarding module 1260 is configured to perform forwarding based on thestored forwarding information 1264. Forwarding module 1260 is alsoconfigured to update the stored membership information 1252 andforwarding information 1264. Forwarding module 1260 can implement one ormore instances of a layer 3 protocol and/or a layer 2 protocol.

Entry 1270 provides an example of membership information stored inmemory of a network device. As shown, entry 1270 includes a setidentifier 1254, information 1256 identifying a bit position (BP), andinformation 1258 identifying a multicast group. The SI and BP identify anode with which entry 1270 is associated, and the multicast groupinformation identifies a multicast group to which the corresponding nodeis subscribed. The storage for membership information 1252 is, in oneembodiment, implemented as a group membership table.

Entry 1272 provides an example of forwarding information that can bestored in memory of a network device. As shown, entry 1272 includesinformation 1266 identifying a BP, a bit string or bit array 1268, andinformation 1269 identifying a neighbor. Forwarding module 1260 uses theinformation in entry 1272 to forward multicast data packets to theinterface associated with the neighbor identified in the entry. Thestorage for forwarding information 1264 is, in one embodiment,implemented as a bit indexed forwarding table (BIFT).

Although the present invention has been described in connection withseveral embodiments, the invention is not intended to be limited to thespecific forms set forth herein. On the contrary, it is intended tocover such alternatives, modifications, and equivalents as can bereasonably included within the scope of the invention as defined by theappended claims.

What is claimed is:
 1. A method comprising: generating a bit-indexedforwarding table (BIFT) at a first network node, wherein the BIFTcomprises a plurality of entries, each entry of which corresponds to abit position of a plurality of bit positions, each bit position of theplurality of bit positions represents an egress network node of aplurality of egress network nodes, the BIFT is generated from a bitindexed routing table (BIRT), comprising a plurality of BIRT entries,and is configured to cause a packet to be forwarded toward one or moreof the plurality of egress network nodes, based at least in part on abit string in the packet, and the generating comprises selecting a bitposition of the plurality of bit positions as a selected bit position,creating an entry of the plurality of entries, wherein the entrycorresponds to the selected bit position, accessing a BIRT entry of theplurality of BIRT entries, wherein the BIRT entry corresponds to theselected bit position, updating a neighbor information field of theentry, using neighbor information from the BIRT entry, wherein theneighbor information is associated with a neighbor network node, and theneighbor network node is a neighbor of the first network node, andupdating forwarding bit mask information in a forwarding bit mask fieldof the entry, using bit position information from the BIRT entry.
 2. Themethod of claim 1, wherein the entry comprises a bit positionidentifying information field, the forwarding bit mask field, and theneighbor information field.
 3. The method of claim 1, wherein: one ormore egress network nodes of the plurality of egress network nodes are adestination of the packet.
 4. The method of claim 1, wherein: theforwarding bit mask information is configured to represent one or moreegress nodes that are reachable from the neighbor network node.
 5. Themethod of claim 4, wherein: the neighbor information comprises anidentifier that identifies the neighbor network node.
 6. The method ofclaim 1, wherein the BIRT comprises one or more mappings that map theeach bit position to at least one of the plurality of egress networknodes.
 7. The method of claim 1, wherein the updating the forwarding bitmask information comprises: generating the forwarding bit maskinformation, wherein the generating the forwarding bit mask informationcomprises sorting the plurality of BIRT entries, wherein the pluralityof BIRT entries are sorted by a neighbor network node associated witheach BIRT entry, selecting one or more selected BIRT entries, whereineach of the one or more selected BIRT entries is associated with acommon neighbor network node, and generating combined entry informationby combining the one or more selected BIRT entries, wherein the updatingthe neighbor information field and the updating the forwarding bit maskinformation are performed using the combined entry information.
 8. Themethod of claim 1, wherein the accessing further comprises: selectingthe BIRT entry based on a value of a bit in the selected bit position;identifying a neighbor network node associated with the BIRT entry;determining whether a destination network node is reachable from thefirst network node via the neighbor network node associated with theBIRT entry; and in response to the destination network node beingreachable from the first network node via the neighbor network nodeassociated with the BIRT entry, setting a bit in a forwarding bit maskassociated with the selected bit position, wherein the forwarding bitmask information comprises the forwarding bit mask.
 9. The method ofclaim 1, wherein the accessing further comprises: selecting the BIRTentry; determining a neighbor network node associated with the BIRTentry; determining a bit position associated with the BIRT entry;determining whether the BIFT comprises an associated entry associatedwith the neighbor network node associated with the BIRT entry; if theBIFT does not comprise the associated entry, creating the associatedentry in the BIFT; and setting a bit in the associated entry, whereinthe bit in the associated entry corresponds to the bit positionassociated with the BIRT entry, and the setting the bit in theassociated entry represents that a destination network nodecorresponding to the bit position is reachable via the neighbor networknode associated with the BIRT entry.
 10. The method of claim 1, whereinas a result of the generating, the BIFT comprises a plurality ofentries, each of which corresponds to a corresponding bit position andstores a corresponding identifier of a plurality of identifiers.
 11. Themethod of claim 10, wherein the BIFT is indexed by bit position suchthat the BIFT allows a plurality of corresponding identifiers of theplurality of identifiers to correspond to another network node.
 12. Themethod of claim 1, further comprising: determining whether another BIRTentry of the plurality of BIRT entries is associated with the neighbornetwork node, wherein the another BIRT entry is associated with theneighbor network node if the neighbor information of the another BIRTentry identifies the neighbor network node.
 13. The method of claim 12,further comprising: in response to the another BIRT entry beingassociated with the neighbor network node, updating the forwarding bitmask information in the forwarding bit mask field of the entry, usingbit position information from the another BIRT entry.
 14. The method ofclaim 13, wherein the updating the forwarding bit mask information inthe forwarding bit mask field of the entry, using bit positioninformation from the another BIRT entry, comprises: setting a bit in theforwarding bit mask field of the entry, wherein the bit is associatedwith the another BIRT entry.
 15. A computer program product comprising:a non-transitory computer-readable storage medium; and a plurality ofinstructions, encoded in the non-transitory computer-readable storagemedium and comprising a first set of instructions, executable by aprocessor of a network device, configured to generate a bit-indexedforwarding table (BIFT) at a first network node, wherein the BIFTcomprises a plurality of entries, each entry of which corresponds to abit position of a plurality of bit positions, each bit position of theplurality of bit positions represents an egress network node of aplurality of egress network nodes, the BIFT is generated from a bitindexed routing table (BIRT), comprising a plurality of BIRT entries,and is configured to cause a packet to be forwarded toward one or moreof the plurality of egress network nodes, based at least in part on abit string in the packet, and the first set of instructions comprises afirst subset of instructions, executable by the processor of the networkdevice, configured to select a bit position of the plurality of bitpositions as a selected bit position, a second subset of instructions,executable by the processor of the network device, configured to createan entry of the plurality of entries, wherein  the entry corresponds tothe selected bit position, a third subset of instructions, executable bythe processor of the network device, configured to access a BIRT entryof the plurality of BIRT entries, wherein  the BIRT entry corresponds tothe selected bit position, a fourth subset of instructions, executableby the processor of the network device, configured to update a neighborinformation field of the entry, using neighbor information from the BIRTentry, wherein  the neighbor information is associated with a neighbornetwork node, and  the neighbor network node is a neighbor of the firstnetwork node, and a fifth subset of instructions, executable by theprocessor of the network device, configured to update forwarding bitmask information in a forwarding bit mask field of the entry, using bitposition information from the BIRT entry.
 16. The computer programproduct of claim 15, further comprising: a second set of instructions,executable by a processor of a network device, configured to determinewhether another BIRT entry of the plurality of BIRT entries isassociated with the neighbor network node, wherein the another BIRTentry is associated with the neighbor network node if the neighborinformation of the another BIRT entry identifies the neighbor networknode; and a third set of instructions, executable by a processor of anetwork device, configured to, in response to the another BIRT entrybeing associated with the neighbor network node, update the forwardingbit mask information in the forwarding bit mask field of the entry,using bit position information from the another BIRT entry.
 17. Anetwork device comprising: one or more hardware processors; one or morenetwork interfaces coupled to the one or more hardware processors,wherein the one or more network interfaces are configured to couple thenetwork device to a network; a non-transitory computer-readable storagemedium coupled to the one or more hardware processors; and a pluralityof instructions, encoded in the non-transitory computer-readable storagemedium and configured to cause the one or more hardware processors togenerate a bit-indexed forwarding table (BIFT), wherein the BIFTcomprises a plurality of entries, each entry of which corresponds to abit position of a plurality of bit positions, each bit position of theplurality of bit positions represents an egress network node of aplurality of egress network nodes, the BIFT is generated from a bitindexed routing table (BIRT), comprising a plurality of BIRT entries,and is configured to cause a packet to be forwarded toward one or moreof the plurality of egress network nodes, based at least in part on abit string in the packet, and the instructions configured to cause theone or more hardware processors to generate the BIFT comprise furtherinstructions configured to cause the one or more hardware processors toselect a bit position of the plurality of bit positions as a selectedbit position, create an entry of the plurality of entries, wherein  theentry corresponds to the selected bit position, access a BIRT entry ofthe plurality of BIRT entries, wherein  the BIRT entry corresponds tothe selected bit position, update a neighbor information field of theentry, using neighbor information from the BIRT entry, wherein  theneighbor information is associated with a neighbor network device, and the neighbor network device is a neighbor of the network device, andupdate forwarding bit mask information in a forwarding bit mask field ofthe entry, using bit position information from the BIRT entry.
 18. Thenetwork device of claim 17, wherein the plurality of instructionsconfigured to cause the one or more hardware processors to update theforwarding bit mask information are further configured to: generate theforwarding bit mask information, wherein the forwarding bit maskinformation is generated by sorting the plurality of BIRT entries,wherein the plurality of BIRT entries are sorted by a neighbor networknode associated with each BIRT entry, selecting one or more selectedBIRT entries, wherein each of the one or more selected BIRT entries isassociated with a common neighbor network node, and generating combinedentry information by combining the one or more selected BIRT entries,wherein the neighbor information field and the forwarding bit mask fieldare updated using the combined entry information.
 19. The network deviceof claim 17, wherein the plurality of instructions configured to causethe one or more hardware processors to update the forwarding bit maskinformation are further configured to: select the BIRT entry based on avalue of a bit in the selected bit position; identify a neighbor networknode associated with the BIRT entry; determine whether a destinationnetwork node is reachable from the network device via the neighbornetwork node associated with the BIRT entry; and in response to thedestination network node being reachable from the network device via theneighbor network node associated with the BIRT entry, set a bit in aforwarding bit mask associated with the selected bit position, whereinthe forwarding bit mask information comprises the forwarding bit mask.20. The network device of claim 17, wherein the plurality ofinstructions configured to cause the one or more hardware processors toupdate the forwarding bit mask information are further configured to:select the BIRT entry; determine a neighbor network node associated withthe BIRT entry; determine a bit position associated with the BIRT entry;determine whether the BIFT comprises an associated entry associated withthe neighbor network node associated with the BIRT entry; if the BIFTdoes not comprise the associated entry, create the associated entry inthe BIFT; and set a bit in the associated entry, wherein the bit in theassociated entry corresponds to the bit position associated with theBIRT entry, and the bit in the associated entry being set representsthat a destination network node corresponding to the bit position isreachable via the neighbor network node associated with the BIRT entry.